domain agnostic measure
A Domain Agnostic Measure for Monitoring and Evaluating GANs
Generative Adversarial Networks (GANs) have shown remarkable results in modeling complex distributions, but their evaluation remains an unsettled issue. Evaluations are essential for: (i) relative assessment of different models and (ii) monitoring the progress of a single model throughout training. The latter cannot be determined by simply inspecting the generator and discriminator loss curves as they behave non-intuitively. We leverage the notion of duality gap from game theory to propose a measure that addresses both (i) and (ii) at a low computational cost. Extensive experiments show the effectiveness of this measure to rank different GAN models and capture the typical GAN failure scenarios, including mode collapse and non-convergent behaviours. This evaluation metric also provides meaningful monitoring on the progression of the loss during training. It highly correlates with FID on natural image datasets, and with domain specific scores for text, sound and cosmology data where FID is not directly suitable. In particular, our proposed metric requires no labels or a pretrained classifier, making it domain agnostic.
Reviews: A Domain Agnostic Measure for Monitoring and Evaluating GANs
The idea of studying GANs from the game theory perspective is not new; however, using the duality gap as a performance metric (some sort of divergence between the generated data distribution and the real data distribution) is original to the best of my knowledge. The paper is written clearly. In terms of significance, while the idea of the duality gap is "natural" when considering the game theory perspective for GANs, it is not clear why this is a good metric for _any_ domain. The authors imply that it is a good idea to find a metric that does not depend on the domain of the data, but given all the parallels between GANs and the different divergences between probability distributions (JS, Wasserstein, etc.) I think the main problem is to find a metric that can be thought as correctly modeling the distance between high-dimensional datasets such as the ones given by images. In that case, modeling this aspect (which is highly domain-dependent) is crucial for understanding what a GAN is capturing about the data distribution.
A Domain Agnostic Measure for Monitoring and Evaluating GANs
Generative Adversarial Networks (GANs) have shown remarkable results in modeling complex distributions, but their evaluation remains an unsettled issue. Evaluations are essential for: (i) relative assessment of different models and (ii) monitoring the progress of a single model throughout training. The latter cannot be determined by simply inspecting the generator and discriminator loss curves as they behave non-intuitively. We leverage the notion of duality gap from game theory to propose a measure that addresses both (i) and (ii) at a low computational cost. Extensive experiments show the effectiveness of this measure to rank different GAN models and capture the typical GAN failure scenarios, including mode collapse and non-convergent behaviours.
A Domain Agnostic Measure for Monitoring and Evaluating GANs
Grnarova, Paulina, Levy, Kfir Y., Lucchi, Aurelien, Perraudin, Nathanael, Goodfellow, Ian, Hofmann, Thomas, Krause, Andreas
Generative Adversarial Networks (GANs) have shown remarkable results in modeling complex distributions, but their evaluation remains an unsettled issue. Evaluations are essential for: (i) relative assessment of different models and (ii) monitoring the progress of a single model throughout training. The latter cannot be determined by simply inspecting the generator and discriminator loss curves as they behave non-intuitively. We leverage the notion of duality gap from game theory to propose a measure that addresses both (i) and (ii) at a low computational cost. Extensive experiments show the effectiveness of this measure to rank different GAN models and capture the typical GAN failure scenarios, including mode collapse and non-convergent behaviours.